A viewing and processing tool for the analysis of a comparable corpus of Kiranti mythology

نویسندگان

  • Aimée Lahaussois
  • Séverine Guillaume
چکیده

This presentation describes a trilingual corpus of three endangered languages of the Kiranti group (Tibeto-Burman family) from Eastern Nepal. The languages, which are exclusively oral, share a rich mythology, and it is thus possible to build a corpus of the same native narrative material in the three languages. The segments of similar semantic content are tagged with a "similarity" label to identify correspondences among the three language versions of the story. An interface has been developed to allow these similarities to be viewed together, in order to allow make possible comparison of the different lexical and morphosyntactic features of each language. A concordancer makes it possible to see the various occurrences of words or glosses, and to further compare and contrast the languages.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

آثار ارزیابی شناختی و سرکوبگری هیجانی بر واکنشهای عصبی خودکار بر اساس حساسیت پردازش حسی

Objectives The aim of this study was to evaluate the effect of emotion regulation strategies of cognitive appraisal and emotional suppression on autonomic nervous reactions based on high and low sensory processing sensitivity among students. Methods For this purpose, 500 students of Bu Ali Sina University of Hamadan were selected through a stratified sampling approach. Based on final score dis...

متن کامل

Hedges in English for Academic Purposes: A Corpus-based study of Iranian EFL learners

Hedges, as tools to express tentativeness and doubt, have been studied in plenty of research papers in the Iranian EFL research setting. However, their use in a learner corpus, portraying Iranian learner English, is in need of more research attention. With this end in view, this study aimed at investigating how Iranian EFL learners who have majored in English-related fields in Iran deployed hed...

متن کامل

Psychological Analysis of Kiumarth Myth in the Light of the Personality Psychology of Jung

Mythology allocated a large part, fundamental and effectively to the human mind. The knowledge of mythology in fact recognizes the important infrastructure of ideas, culture and civilization. One of the most common ways to study mythology is to implement psychological ideas in mythology. The result is not only a better understanding of mythology, but also a better understanding of human psyche ...

متن کامل

Producing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations

The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012